Your browser doesn't support javascript.
Show: 20 | 50 | 100
Results 1 - 20 de 78
Filter
1.
ACM Web Conference 2023 - Companion of the World Wide Web Conference, WWW 2023 ; : 688-693, 2023.
Article in English | Scopus | ID: covidwho-20241249

ABSTRACT

Online misinformation has become a major concern in recent years, and it has been further emphasized during the COVID-19 pandemic. Social media platforms, such as Twitter, can be serious vectors of misinformation online. In order to better understand the spread of these fake-news, lies, deceptions, and rumours, we analyze the correlations between the following textual features in tweets: emotion, sentiment, political bias, stance, veracity and conspiracy theories. We train several transformer-based classifiers from multiple datasets to detect these textual features and identify potential correlations using conditional distributions of the labels. Our results show that the online discourse regarding some topics, such as COVID-19 regulations or conspiracy theories, is highly controversial and reflects the actual U.S. political landscape. © 2023 ACM.

2.
Journal of the Institute of Science & Technology / Fen Bilimleri Estitüsü Dergisi ; 13(2):778-791, 2023.
Article in Turkish | Academic Search Complete | ID: covidwho-20240938

ABSTRACT

The new type of coronavirus disease (COVID-19), which has emerged in recent years, has become a serious disease that threatens health worldwide. COVID-19, which can be transmitted very quickly and with serious increases in death, has paved the way for many concerns. With the spread of the epidemic to a universal dimension, many studies have been carried out for the early diagnosis of this disease. With early diagnosis, both fatal cases are prevented and the planning of the epidemic can be easier. The fact that X-ışını images are much more advantageous than other imaging techniques in terms of time and applicability, and also that they are economical, has led to the focus of early diagnosis-based applications and methods on these images. Deep learning approaches have had a great impact in the diagnosis of COVID-19, as in the diagnosis of many diseases. In this study, we propose a diagnostic system based on the transformer method, which is the most up-to-date and much more popular architecture than previous techniques of deep learning such as CNN-based approaches. This method includes an approach based on vision transformer models and a more effective diagnosis of COVID-19 disease on a new dataset, the COVID-QU-Ex dataset. In experimental studies, it has been observed that vision transformer models are more successful than CNN models. In addition, the ViT-L16 model showed a much higher performance compared to similar studies in the literature, providing test accuracy and F1- score of over 96%. (English) [ FROM AUTHOR] Son yıllarda ortaya çıkan yeni tip Koronavirüs hastalığı (COVID-19), dünya çapında sağlığı tehdit eden ciddi bir hastalık olmuştur. COVID-19 çok hızlı bir şekilde bulaşabilen ve ciddi ölüm artışları ile birçok endişeye zemin hazırlamıştır. Salgının evrensel boyuta taşınmasıyla bu hastalığın erken teşhisine yönelik birçok çalışma yapılmıştır. Erken teşhis ile hem ölümcül vakaların önüne geçilmiş olunmakta hem de salgının planlanması daha kolay olabilmektedir. Xışını görüntülerinin zaman ve uygulanabilirlik açısından diğer görüntüleme tekniklerine nazaran çok daha avantajlı olması ve ayrıca ekonomik olması erken teşhis bazlı uygulama ve yöntemlerin bu görüntülerin üzerine yoğunlaşmasına neden olmuştur. Derin öğrenme yaklaşımları birçok hastalık teşhisinde olduğu gibi COVID-19 teşhisinde de çok büyük bir etki oluşturmuştur. Bu çalışmada, derin öğrenmenin CNN tabanlı yaklaşımları gibi daha önceki tekniklerinden ziyade en güncel ve çok daha popüler bir mimarisi olan transformatör yöntemine dayalı bir teşhis sistemi önerdik. Bu sistem, görü transformatör modelleri temelli bir yaklaşım ve yeni bir veri seti olan COVID-QU-Ex üzerinde COVID-19 hastalığının daha efektif bir teşhisini içermektedir. Deneysel çalışmalarda, görü transformatör modellerinin CNN modellerinden daha başarılı olduğu gözlemlenmiştir. Ayrıca, ViT-L16 modeli %96'nın üzerinde test doğruluğu ve F1-skoru sunarak, literatürde benzer çalışmalara kıyasla çok daha yüksek bir başarım göstermiştir. (Turkish) [ FROM AUTHOR] Copyright of Journal of the Institute of Science & Technology / Fen Bilimleri Estitüsü Dergisi is the property of Igdir University, Institute of Science & Technology and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full . (Copyright applies to all s.)

3.
IEEE Transactions on Consumer Electronics ; : 1-1, 2023.
Article in English | Scopus | ID: covidwho-20234982

ABSTRACT

Recently, crowd counting has attracted significant attention, particularly in the context of the COVID-19 pandemic, due to its ability to automatically provide accurate crowd numbers in images. To address the challenges of location-level labeling, several transformer-based crowd counting methods have been proposed with only count-level supervision. However, these methods directly use the transformer as an encoder without considering the uneven crowd distribution. To address this issue, we propose CCTwins, a novel transformer-based crowd counting method with only count-level supervision. Specifically, we introduce an adaptive scene consistency attention mechanism to enhance the transformer-based model Twins-SVT-L for feature extraction in crowded scenes. Additionally, we design a multi-level weakly-supervised loss function that generates estimated crowd numbers in a coarse-to-fine manner, making it more appropriate for weakly-supervised settings. Moreover, intermediate features supervised by count-level labels are utilized to fuse multi-scale features. Experimental results on four public datasets demonstrate that our proposed method outperforms the state-of-the-art weakly-supervised methods, achieving up to a 16.6% improvement in MAE and up to a 13.8% improvement in RMSE across all evaluation settings. Moreover, the proposed CCTwins obtains competitive counting performance, even when compared to the state-of-the-art fully-supervised methods. IEEE

4.
Computers ; 12(5), 2023.
Article in English | Web of Science | ID: covidwho-20234116

ABSTRACT

The simultaneous advances in deep learning and the Internet of Things (IoT) have benefited distributed deep learning paradigms. Federated learning is one of the most promising frameworks, where a server works with local learners to train a global model. The intrinsic heterogeneity of IoT devices, or non-independent and identically distributed (Non-I.I.D.) data, combined with the unstable communication network environment, causes a bottleneck that slows convergence and degrades learning efficiency. Additionally, the majority of weight averaging-based model aggregation approaches raise questions about learning fairness. In this paper, we propose a peer-to-peer federated learning (P2PFL) framework based on Vision Transformers (ViT) models to help solve some of the above issues and classify COVID-19 vs. normal cases on Chest-X-Ray (CXR) images. Particularly, clients jointly iterate and aggregate the models in order to build a robust model. The experimental results demonstrate that the proposed approach is capable of significantly improving the performance of the model with an Area Under Curve (AUC) of 0.92 and 0.99 for hospital-1 and hospital-2, respectively.

5.
Ieee Transactions on Computational Social Systems ; 2023.
Article in English | Web of Science | ID: covidwho-2328331

ABSTRACT

Social media platforms have become a vital source of information during the outbreak of the pandemic (COVID-19). The phenomena of fake information or news spread through social media have become increasingly prevalent and a powerful tool for information proliferation. Detecting fake news is crucial for the betterment of society. Existing fake news detection models focus on increasing the performance which leads to overfitting and lag generalizability. Hence, these models require training for various datasets of the same domain with significant variations in the distribution. In our work, we have addressed this overfitting issue by designing a robust distribution generalization of transformers-based generative adversarial network (RDGT-GAN) architecture, which can generalize the model for COVID-19 fake news datasets with different distributions without retraining. Based on our experimental findings, it is evident that the proposed model outperforms the current state-of-the-art (SOTA) models in terms of performance.

6.
28th International Computer Conference, Computer Society of Iran, CSICC 2023 ; 2023.
Article in English | Scopus | ID: covidwho-2323458

ABSTRACT

Choosing a proper outfit is one of the problems we deal with every day. Today, people tend to use online websites for shopping, and the COVID-19 situation forced this condition more than before. In this research, we proposed a new architecture for multi-fashion item retrieval from a website database. We deployed a CLIP transformer model instead of convolutional neural networks in a triplet network. We also added a long short-term memory network (LSTM) to automatically extract and code the image features to generate descriptive text for each input image. Our OutCLIP model succeeded in doing its task with 83% precision and 85% recall accuracy in multi-item retrieval. This model can be trained and used in fashion retrieval problems and improve the former proposed models. Considering the descriptive text and the image together gives the model a better understanding of the concept and improves its generalization. © 2023 IEEE.

7.
1st International Conference on Futuristic Technologies, INCOFT 2022 ; 2022.
Article in English | Scopus | ID: covidwho-2316902

ABSTRACT

The small size and inherent superior electrical characteristics of a toroid has made it the first choice for many Original Equipment Manufacturers (OEMs). However, the lack of knowledge regarding the toroidal coil winding equipment is still hampering the growth of toroid as the first choice for transformers, inductors and other electrical applications. Additionally, due to Covid-19 pandemic and lockdown situation, small scale companies are lacking skilled manpower for the high precision task of toroidal core winding and taping. Although the machine is readily available in the market, the cost is still very high. Toroidal core winding machine is an equipment used for the purpose of winding toroidal cores which is used in various electrical machines such as current transformers, power transformers, isolation transformers, inductors and chokes, auto transformers, etc. This project aims to develop a low-cost toroidal winding machine with a user-friendly digital interface for selection of winding parameters as per the user input. The winding machine developed in this project is efficient and reliable with high-speed performance and negligible error. © 2022 IEEE.

8.
Engineering Applications of Artificial Intelligence ; 122, 2023.
Article in English | Web of Science | ID: covidwho-2310316

ABSTRACT

Vision Transformers (ViTs), with the magnificent potential to unravel the information contained within images, have evolved as one of the most contemporary and dominant architectures that are being used in the field of computer vision. These are immensely utilized by plenty of researchers to perform new as well as former experiments. Here, in this article, we investigate the intersection of vision transformers and medical images. We proffered an overview of various ViT based frameworks that are being used by different researchers to decipher the obstacles in medical computer vision. We surveyed the applications of Vision Transformers in different areas of medical computer vision such as image-based disease classification, anatomical structure segmentation, registration, region-based lesion detection, captioning, report generation, and reconstruction using multiple medical imaging modalities that greatly assist in medical diagnosis and hence treatment process. Along with this, we also demystify several imaging modalities used in medical computer vision. Moreover, to get more insight and deeper understanding, the self-attention mechanism of transformers is also explained briefly. Conclusively, the ViT based solutions for each image analytics task are critically analyzed, open challenges are discussed and the pointers to possible solutions for future direction are deliberated. We hope this review article will open future research directions for medical computer vision researchers.

9.
Applied Sciences (Switzerland) ; 13(6), 2023.
Article in English | Scopus | ID: covidwho-2305038

ABSTRACT

The COVID-19 pandemic has been a major global concern in the field of respiratory diseases, with healthcare institutions and partners investing significant resources to improve the detection and severity assessment of the virus. In an effort to further enhance the detection of COVID-19, researchers have investigated the performance of current detection methodologies and proposed new approaches that leverage deep learning techniques. In this article, the authors propose a two-step transformer model for the multi-class classification of COVID-19 images in a patient-aware manner. This model is implemented using transfer learning, which allows for the efficient use of pre-trained models to accelerate the training of the proposed model. The authors compare the performance of their proposed model to other CNN models commonly used in the detection of COVID-19. The experimental results of the study show that CNN-based deep learning networks obtained an accuracy in the range of 0.76–0.92. However, the proposed two-step transformer model implemented with transfer learning achieved a significantly higher accuracy of 0.9735 ± 0.0051. This result indicates that the proposed model is a promising approach to improving the detection of COVID-19. Overall, the findings of this study highlight the potential of deep learning techniques, particularly the use of transfer learning and transformer models, to enhance the detection of COVID-19. These approaches can help healthcare institutions and partners to reduce the time and difficulty in detecting the virus, ultimately leading to more effective and timely treatment for patients. © 2023 by the authors.

10.
12th International Conference on Electrical and Computer Engineering, ICECE 2022 ; : 76-79, 2022.
Article in English | Scopus | ID: covidwho-2297743

ABSTRACT

The vaccination program which helps avert pandemics is facing new hurdles, including the emergence of hazardous new virus strains and public distrust. Analyzing the sentiment expressed in social media interactions related to vaccines may aid the health authority in implementing public safety procedures and guide the government in developing appropriate policies. The purpose of this research is to identify the public sentiments toward the COVID-19 vaccination in Bangladesh from social media comments. Comments posted on social media platforms often mix formal and informal language known as code-mixed text and do not adhere to any particular grammatical standards. In addition, the Bangla language lacks computational models and annotated resources for sentiment analysis. To overcome this, we created CoVaxBD, a Bangla-English code-mixed and sentiment-annotated corpus of Facebook comments. This paper also proposes a model for sentiment analysis based on the multilingual BERT. It achieves a validation accuracy of around 97.3 % and a precision score of approximately 97.4%. © 2022 IEEE.

11.
JAMIA Open ; 6(2): ooad023, 2023 Jul.
Article in English | MEDLINE | ID: covidwho-2306120

ABSTRACT

Objective: To develop and apply a natural language processing (NLP)-based approach to analyze public sentiments on social media and their geographic pattern in the United States toward coronavirus disease 2019 (COVID-19) vaccination. We also aim to provide insights to facilitate the understanding of the public attitudes and concerns regarding COVID-19 vaccination. Methods: We collected Tweet posts by the residents in the United States after the dissemination of the COVID-19 vaccine. We performed sentiment analysis based on the Bidirectional Encoder Representations from Transformers (BERT) and qualitative content analysis. Time series models were leveraged to describe sentiment trends. Key topics were analyzed longitudinally and geospatially. Results: A total of 3 198 686 Tweets related to COVID-19 vaccination were extracted from January 2021 to February 2022. 2 358 783 Tweets were identified to contain clear opinions, among which 824 755 (35.0%) expressed negative opinions towards vaccination while 1 534 028 (65.0%) demonstrated positive opinions. The accuracy of the BERT model was 79.67%. The key hashtag-based topics include Pfizer, breaking, wearamask, and smartnews. The sentiment towards vaccination across the states showed manifest variability. Key barriers to vaccination include mistrust, hesitancy, safety concern, misinformation, and inequity. Conclusion: We found that opinions toward the COVID-19 vaccination varied across different places and over time. This study demonstrates the potential of an analytical pipeline, which integrates NLP-enabled modeling, time series, and geospatial analyses of social media data. Such analyses could enable real-time assessment, at scale, of public confidence and trust in COVID-19 vaccination, help address the concerns of vaccine skeptics, and provide support for developing tailored policies and communication strategies to maximize uptake.

12.
Comput Biol Med ; 159: 106847, 2023 06.
Article in English | MEDLINE | ID: covidwho-2304356

ABSTRACT

BACKGROUND: Convolutional Neural Networks (CNNs) and the hybrid models of CNNs and Vision Transformers (VITs) are the recent mainstream methods for COVID-19 medical image diagnosis. However, pure CNNs lack global modeling ability, and the hybrid models of CNNs and VITs have problems such as large parameters and computational complexity. These models are difficult to be used effectively for medical diagnosis in just-in-time applications. METHODS: Therefore, a lightweight medical diagnosis network CTMLP based on convolutions and multi-layer perceptrons (MLPs) is proposed for the diagnosis of COVID-19. The previous self-supervised algorithms are based on CNNs and VITs, and the effectiveness of such algorithms for MLPs is not yet known. At the same time, due to the lack of ImageNet-scale datasets in the medical image domain for model pre-training. So, a pre-training scheme TL-DeCo based on transfer learning and self-supervised learning was constructed. In addition, TL-DeCo is too tedious and resource-consuming to build a new model each time. Therefore, a guided self-supervised pre-training scheme was constructed for the new lightweight model pre-training. RESULTS: The proposed CTMLP achieves an accuracy of 97.51%, an f1-score of 97.43%, and a recall of 98.91% without pre-training, even with only 48% of the number of ResNet50 parameters. Furthermore, the proposed guided self-supervised learning scheme can improve the baseline of simple self-supervised learning by 1%-1.27%. CONCLUSION: The final results show that the proposed CTMLP can replace CNNs or Transformers for a more efficient diagnosis of COVID-19. In addition, the additional pre-training framework was developed to make it more promising in clinical practice.


Subject(s)
COVID-19 Testing , COVID-19 , Humans , COVID-19/diagnostic imaging , Neural Networks, Computer , Algorithms , Endoscopy
13.
Soc Netw Anal Min ; 13(1): 71, 2023.
Article in English | MEDLINE | ID: covidwho-2290479

ABSTRACT

Since the COVID-19 pandemic, healthcare services, particularly remote and automated healthcare consultations, have gained increased attention. Medical bots, which provide medical advice and support, are becoming increasingly popular. They offer numerous benefits, including 24/7 access to medical counseling, reduced appointment wait times by providing quick answers to common questions or concerns, and cost savings associated with fewer visits or tests required for diagnosis and treatment plans. The success of medical bots depends on the quality of their learning, which in turn depends on the appropriate corpus within the domain of interest. Arabic is one of the most commonly used languages for sharing users' internet content. However, implementing medical bots in Arabic faces several challenges, including the language's morphological composition, the diversity of dialects, and the need for an appropriate and large enough corpus in the medical domain. To address this gap, this paper introduces the largest Arabic Healthcare Q &A dataset, called MAQA, consisting of over 430,000 questions distributed across 20 medical specializations. Furthermore, this paper adopts three deep learning models, namely LSTM, Bi-LSTM, and Transformers, for experimenting and benchmarking the proposed corpus MAQA. The experimental results demonstrate that the recent Transformer model outperforms the traditional deep learning models, achieving an average cosine similarity of 80.81% and a BLeU score of 58%.

14.
2nd International Conference on Computers and Automation, CompAuto 2022 ; : 119-123, 2022.
Article in English | Scopus | ID: covidwho-2268883

ABSTRACT

Proposed and developed 5 years ago, Transformer has been a prevailing machine learning method and is widely used to solve various kinds of practical problems [1]. According to relevant works, Transformer has performed well in both natural language processing and computer vision tasks, so we would like to test its effectiveness in prediction, specifically, time series prediction. Over the past two years, COVID-19 is no doubt one of the major factors that influences the changes in the stock prices, and the medical industry should be among the most significantly affected, which would provide an ideal sample for us to study transformer on time series prediction. In this paper, we not only construct a machine learning model using Transformer to predict the stock prices of one medical company but also add a convolution layer to try to optimize the predictions. The comparison of the outcome from the two models suggests that the convolution layer could improve the performance of the naive transformer in several ways. © 2022 IEEE.

15.
21st International Conference on Artificial Intelligence and Soft Computing, ICAISC 2022 ; 13588 LNAI:61-71, 2023.
Article in English | Scopus | ID: covidwho-2266637

ABSTRACT

Traditional approaches to financial asset allocation start with returns forecasting followed by an optimization stage that decides the optimal asset weights. Any errors made during the forecasting step reduce the accuracy of the asset weightings, and hence the profitability of the overall portfolio. The Portfolio Transformer (PT) network, introduced here, circumvents the need to predict asset returns and instead directly optimizes the Sharpe ratio, a risk-adjusted performance metric widely used in practice. The PT is a novel end-to-end portfolio optimization framework, inspired by the numerous successes of attention mechanisms in natural language processing. With its full encoder-decoder architecture, specialized time encoding layers, and gating components, the PT has a high capacity to learn long-term dependencies among portfolio assets and hence can adapt more quickly to changing market conditions such as the COVID-19 pandemic. To demonstrate its robustness, the PT is compared against other algorithms, including the current LSTM-based state of the art, on three different datasets, with results showing that it offers the best risk-adjusted performance. © 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.

16.
9th International Symposium on Applied Computing for Software and Smart systems, ACSS 2022 ; 555:227-234, 2023.
Article in English | Scopus | ID: covidwho-2261125

ABSTRACT

Stress is one of the major health issues of the world and one of the major reasons for committing suicide. Also, it leads to other mental health issues such as depression, anxiety etc., and damage to organs related to respiratory, cardiovascular and nervous systems. In recent years, stress has impacted many individuals due to the pandemic situation. Since the governments across the globe had started to impose lockdowns, the levels of stress significantly raised because of the disturbances led by covid infections, losing loved ones, continuous engagement with laptops and mobiles etc. It is also found that stress has not only disturbed the health condition but also disturbed the relationships and became a self-destruction component. This project is aimed to help those people to understand their stress and consult a psychologist at right time to overcome the situation. Though stress is an active area of research and achieved high performance of models, those were based on signal and speech which were computationally costlier and text-based research work using a state-of-the-art model called the BERT has achieved an f1-score i.e. 80.65%. This project focuses on text-domain and uses open-sourced Stress Analysis on Social Media dataset available on Kaggle which contains 3.6 K samples. In this project, both Machine Learning and Deep Learning Models were trained with 80% of the data and validated with 20% of the data. After, optimization and evaluation of several models, the best model has achieved a benchmark result of 83.74% f1-score on test data using a new network architecture i.e. combination of stacked Transformer Encoder layers with stacked Bi-directional-LSTM. In addition to this, an explainable AI has been implemented for an embedding layer to inspect input attributions in predicting the results. © 2023, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

17.
Information & Management ; 59(2):1-18, 2022.
Article in English | APA PsycInfo | ID: covidwho-2254327

ABSTRACT

This study investigates customer satisfaction through aspect-level sentiment analysis and visual analytics. We collected and examined the flight reviews on TripAdvisor from January 2016 to August 2020 to gauge the impact of COVID-19 on passenger travel sentiment in several aspects. Till now, information systems, management, and tourism research have paid little attention to the use of deep learning and word embedding techniques, such as bidirectional encoder representations from transformers, especially for aspect-level sentiment analysis. This paper aims to identify perceived aspect-based sentiments and predict unrated sentiments for various categories to address this research gap. Ultimately, this study complements existing sentiment analysis methods and extends the use of data-driven and visual analytics approaches to better understand customer satisfaction in the airline industry and within the context of the COVID-19. Our proposed method outperforms baseline comparisons and therefore contributes to the theoretical and managerial literature. (PsycInfo Database Record (c) 2023 APA, all rights reserved)

18.
Data ; 8(3), 2023.
Article in English | Scopus | ID: covidwho-2288144

ABSTRACT

To address the COVID-19 situation in Indonesia, the Indonesian government has adopted a number of policies. One of them is a vacation-related policy. Government measures with regard to this vacation policy have produced a wide range of viewpoints in society, which have been extensively shared on social media, including YouTube. However, there has not been any computerized system developed to date that can assess people's social media reactions. Therefore, this paper provides a sentiment analysis application to this government policy by employing a bidirectional encoder representation from transformers (BERT) approach. The study method began with data collecting, data labeling, data preprocessing, BERT model training, and model evaluation. This study created a new dataset for this topic. The data were collected from the comments section of YouTube, and were categorized into three categories: positive, neutral, and negative. This research yielded an F-score of 84.33%. Another contribution from this study regards the methodology for processing sentiment analysis in Indonesian. In addition, the model was created as an application using the Python programming language and the Flask framework. The government can learn the extent to which the public accepts the policies that have been implemented by utilizing this research. © 2023 by the authors.

19.
Applied Sciences ; 13(5):3116, 2023.
Article in English | ProQuest Central | ID: covidwho-2283057

ABSTRACT

Simple SummaryThe idea of identifying persons using the fewest traits from the face, particularly the area surrounding the eye, was carried out in light of the present COVID-19 scenario. This may also be applied to doctors working in hospitals, the military, and even in certain faiths where the face is mostly covered, except the eyes. The most recent advancement in computer vision, called vision transformers, has been tested for the UBIPr dataset for different architectures. The proposed model is pretrained on an openly available ImageNet dataset with 1 K classes and 1.3 M pictures before using it on the real dataset of interest, and accordingly the input images are scaled to 224 × 224. The PyTorch framework, which is particularly helpful for creating complicated neural networks, has been utilized to create our models. To avoid overfitting, the stratified K-Fold technique is used to make the model less prone to overfitting. The accuracy results have proven that these techniques are highly effective for both person identification and gender classification.AbstractMany biometrics advancements have been widely used for security applications. This field's evolution began with fingerprints and continued with periocular imaging, which has gained popularity due to the pandemic scenario. CNN (convolutional neural networks) has revolutionized the computer vision domain by demonstrating various state-of-the-art results (performance metrics) with the help of deep-learning-based architectures. The latest transformation has happened with the invention of transformers, which are used in NLP (natural language processing) and are presently being adapted for computer vision. In this work, we have implemented five different ViT- (vision transformer) based architectures for person identification and gender classification. The experiment was performed on the ViT architectures and their modified counterparts. In general, the samples selected for train:val:test splits are random, and the trained model may get affected by overfitting. To overcome this, we have performed 5-fold cross-validation-based analysis. The experiment's performance matrix indicates that the proposed method achieved better results for gender classification as well as person identification. We also experimented with train-val-test partitions for benchmarking with existing architectures and observed significant improvements. We utilized the publicly available UBIPr dataset for performing this experimentation.

20.
8th IEEE International Conference on Collaboration and Internet Computing, CIC 2022 ; : 82-88, 2022.
Article in English | Scopus | ID: covidwho-2283041

ABSTRACT

The Covid-19 pandemic has caused a dramatic and parallel rise in dangerous misinformation, denoted an 'infodemic' by the CDC and WHO. Misinformation tied to the Covid-19 infodemic changes continuously;this can lead to performance degradation of fine-tuned models due to concept drift. Degredation can be mitigated if models generalize well-enough to capture some cyclical aspects of drifted data. In this paper, we explore generalizability of pre-trained and fine-tuned fake news detectors across 9 fake news datasets. We show that existing models often overfit on their training dataset and have poor performance on unseen data. However, on some subsets of unseen data that overlap with training data, models have higher accuracy. Based on this observation, we also present KMeans-Proxy, a fast and effective method based on K-Means clustering for quickly identifying these overlapping subsets of unseen data. KMeans-Proxy improves generalizability on unseen fake news datasets by 0.1-0.2 f1-points across datasets. We present both our generalizability experiments as well as KMeans-Proxy to further research in tackling the fake news problem. © 2022 IEEE.

SELECTION OF CITATIONS
SEARCH DETAIL